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Abstract — In  this  paper,  we  present  an  autonomous  cognitive 
radio  (CR)  architecture  that  incorporates  the  main  features  of 
cognition.  This  model,  referred  to  as  the  Radiobot,  is  capable 
of  self-learning  and  self-reconfiguration  to  match  its  RF  environ¬ 
ment.  The  proposed  CR  architecture  assumes  a  joint  blind  energy 
and  cyclostationary  detection  methods  to  classify  the  communi¬ 
cation  systems  in  its  vicinity,  without  any  prior  knowledge  of  the 
sensed  signals.  We  derive  the  receiver  operating  characteristic 
(ROC)  of  the  energy  detector  and  show,  analytically,  the  impact 
of  the  sliding  window  length  on  the  energy  detection.  A  learning 
algorithm  is  proposed,  allowing  the  Radiobot  to  independently 
learn  from  its  past  experience  in  order  to  optimize  its  operating 
parameters.  By  applying  the  learning  algorithm  to  the  sensing 
module,  we  verify,  through  simulations,  the  convergence  of  the 
proposed  algorithm  to  the  optimal  solution. 

I.  Introduction 

Inspired  by  the  concept  of  robots  in  mechanical  systems, 
a  next  generation  of  autonomous  cognitive  radio  (CR)  archi¬ 
tecture  called  the  Radiobot  was  proposed  in  [1],  The  notion 
of  Radiobots  goes  beyond  the  current  trend  of  CR  which 
primarily  aims  at  achieving  dynamic  spectrum  sharing  (DSS). 
In  [1],  the  authors  laid  out  a  broader  and  more  ambitious 
notion  of  autonomous  radio  devices  that  are  capable  of  inde¬ 
pendently  finding  and  joining/avoiding  any  radio  network  in 
their  vicinity  in  order  to  achieve  the  best  possible  performance 
in  any  given  RF  environment.  To  achieve  this,  one  of  the 
most  important  requirements  for  the  Radiobot  is  to  be  aware 
of  its  RF  environment  [1],  More  particularly,  the  Radiobot 
should  be  able  to  identify/classify  different  activities  in  its 
vicinity  by  using  both  effective  and  efficient  sensing,  feature 
extraction  and  learning  methods.  For  instance,  by  identifying 
the  types  of  active  communication  systems  and  the  number 
of  users  associated  with  each  system,  it  makes  it  possible  for 
a  Radiobot  to  adjust  its  communication  mode  to  communi¬ 
cate  efficiently  while  avoiding  harmful  interferences,  such  as 
jammers.  Moreover,  by  knowing  the  number  of  users  in  each 
sub-band,  the  Radiobot  can  estimate  the  available  spectrum 
resources,  which  helps  to  implement  efficient  spectrum  sensing 
and  communication  policies. 

Given  the  importance  of  machine  learning  and  reconfig- 


urable  hardware  in  the  design  of  the  Radiobots  [1],  we 
propose,  in  this  paper,  a  learning  algorithm  that  allows  the 
Radiobot  to  improve  its  sensing  techniques  and  adapt  its 
design  parameters  based  on  its  past  experience.  This  allows 
realizing  the  Observe-Decide-Act-Learn  (ODAL)  cognition 
cycle  of  [1]  and  leads  to  the  design  of  a  real  cognitive  engine 
(CE). 

Different  learning  algorithms,  such  as  the  reinforcement 
learning  (RL)  [2],  have  previously  been  applied  to  CR  ap¬ 
plications  to  achieve  autonomous  cognitive  behavior.  The  RL 
is  an  unsupervised  learning  algorithm  that  allows  an  agent  to 
independently  learn  from  past  experience  by  trial-and-error 
[2].  This  algorithm  was  applied  for  interference  control  in  CR 
networks  in  [3]  and  for  deriving  distributed  Medium  Access 
Control  (MAC)  protocols  for  CR’s  in  [4].  The  RL  is  suitable 
for  Markovian  environments  in  which  it  can  assure  certain 
optimal  behavior  [2],  In  our  case,  however,  since  we  do  not 
assume  any  underlying  Markovian  framework,  we  propose 
a  different  unsupervised  learning  algorithm  that  allows  the 
Radiobot  to  self-reconfigure  its  operating  parameters.  Our 
proposed  algorithm  is  similar  to  that  in  [5]  and  is  intended  to 
optimize  the  spectrum  sensing  thresholds  in  order  to  achieve 
a  desired  false  alarm  probability.  According  to  [5],  the  false 
alarm  probability  is  updated  during  a  training  period  in  which 
no  signals  are  present.  In  contrast,  our  proposed  algorithm 
does  not  require  a  training  period  and  can  update  the  cy¬ 
clostationary  detector  parameters  during  normal  operation, 
whenever  no  signals  are  detected  by  the  energy  detector. 
Moreover,  the  proposed  learning  algorithm  does  not  assume 
any  prior  knowledge  of  the  observed  data,  and  it  minimizes  the 
Kullback-Leibler  distance  [5]  between  the  desired  and  actual 
performance  measures.  Due  to  the  convexity  of  the  Kullback- 
Leibler  function,  the  proposed  algorithm  is  guaranteed  to 
converge  to  the  optimal  solution,  as  verified  in  the  simulations. 

In  contrast  with  previously  proposed  MAC  sensing  pro¬ 
tocols  which  assume  prior  knowledge  of  either  the  primary 
channels  [6]  or  the  cyclostationary  properties  of  the  sensed 
signals  [7],  our  proposed  Radiobot  architecture  assumes  no 
such  knowledge.  Instead,  it  uses  a  blind  joint  energy  and 
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cyclostationary  detection  method  without  any  prior  informa¬ 
tion  to  identify  the  carrier  frequencies  and  cyclostationarity 
properties  of  active  signals.  The  performance  of  the  carrier 
frequency  detector  is  evaluated  through  its  Receiver  Operating 
Characteristic  (ROC).  Based  on  the  ROC,  we  show  analytically 
that  the  carrier  frequency  detection  can  be  improved  by  using 
energy  detection  with  a  sliding  window.  This  result  was 
previously  shown,  only  through  simulations,  in  [8],  Moreover, 
in  this  paper,  we  develop  a  feature  extraction  algorithm  that 
allows  determining  both  the  types  and  the  number  of  users 
at  each  carrier  frequency.  Several  signal  classification  and 
feature  extraction  methods  have  been  proposed  in  the  liter¬ 
ature,  including,  for  example,  the  model  in  [9]  which  uses 
support  vector  machines  (SVM’s).  In  this  paper,  however,  we 
employ  non-parametric  learning  for  autonomous  signal  iden¬ 
tification/classification.  In  non-parametric  approaches,  models 
are  extracted  from  the  structures  present  in  the  data  itself. 
Similarly  to  [10],  we  classify  the  extracted  features  based 
on  a  Chinese  restaurant  process  (CRP).  The  proposed  feature 
extraction/classification  and  learning  are  validated  through 
simulations. 

The  remainder  of  this  paper  is  organized  as  follows:  Section 
II  defines  the  system  model.  Section  III  describes  the  Ra- 
diobot’s  learning  mechanism.  We  show  the  simulation  results 
in  Section  IV  and  conclude  the  paper  in  Section  V. 

II.  System  Model 

The  proposed  Radiobot  architecture  is  aimed  to  operate  in 
a  wideband  spectrum,  which  is  split  into  N  sub-bands  that 
are  sensed  sequentially  in  a  round-robin  style.  To  illustrate 
the  same  operating  procedures  in  each  and  every  sub-band, 
we  consider  a  single  sub-band  and  denote  by  N(t)  the  total 
number  of  signals  at  time  t  in  this  particular  sub-band.  Prior 
to  sampling,  the  received  signal  is  down-converted  to  interme¬ 
diate  frequency  (IF)  by  using  a  local  oscillator  with  frequency 
fj.  The  corresponding  IF  signal  y(t)  can  be  expressed  as 
y{t)  =  3?{E +n(t)  , 
where  *  denotes  the  convolution,  Xk(t)  denotes  the  baseband 
signal  of  user  k  using  carrier  frequency  fCk,  and  hk{t)  denotes 
the  baseband  equivalent  channel  impulse  response  of  the 
channel  between  the  user  k  and  the  Radiobot.  The  receiver 
noise  is  denoted  by  n(f),  which  is  assumed  to  be  a  white 
noise  process  with  double-sided  power  spectral  density  (PSD) 
of  Ff-  The  average  noise  power  at  the  output  of  the  sweeping 
IF  filter  will  thus  be  Pn  =  — (E  where  B  is  the  IF  filter 
bandwidth.  The  resulting  signal-to-noise  ratio  (SNR)  at  the 
output  of  the  IF  filter  is  SNR  =  Ja,  where  Ps  is  the  received 
signal  power.  We  denote  by  y  =  [y(l),  •  •  •  ,y(M)]T  an  M- 
length  vector  of  the  sampled  discrete-time  IF  signal,  such 
that  y{k)  =  y(kTs),  for  k  €  {l,---  ,Mj,  where  Ts  is  the 
sampling  period. 

A.  Identification  of  RF  Activities 

We  apply  a  cyclostationarity-based  signal  detection  to  the 
sensed  vector  y  in  order  to  identify  the  RF  activity  in  the 
sub-band  of  interest.  We  identify  the  active  signals  by  their 


carrier  frequencies  and  the  associated  cyclic  frequencies  that 
are  induced  by  their  symbol  rates  and  coding  schemes  [11], 
By  using  the  discrete-frequency  smoothing  method  described 
below  [  12],  we  compute  an  estimate  of  the  Spectral  Correlation 
Function  (SCF)  Sf(t.  f  )  for  a  general  discrete  signal  {x(t  — 
/cTs)}Eq\  for  each  sub-band.  Firstly,  the  corresponding  FFT 
at  time  t  is  defined  as  [12]: 

M- 1 

X(t,  f)=Y l  a(*  -  kTs)x(t  -  kT3)e~j2nf<-t~kTs\  (1) 

k= 0 

where  a(t)  is  a  triangular  data  tapering  window.  The  FFT 
X(t,f)  is  defined  over  the  set  of  frequencies  {— 4^,—  + 
Fs,--'  ,  4  ^  Fs,  where  fs  =  ^  and  Fs  =  is  the 
frequency  increment.  An  approximation  to  the  SCF  is  [12]: 


S?  (*./)  = 


(L- 1)/2 

E  - 

i/=-(L-l)/2 


rv  ry 

X{t,  f+—+vFs)X*(t,  f——+vFs), 


where  T  =  MTS  is  the  time  length  of  the  data  segment,  a 
is  the  cyclic  frequency  and  L  (an  odd  number)  is  the  spectral 
smoothing  window  length.  By  setting  a  =  0,  we  may  obtain  an 
estimation  of  the  PSD  of  the  discrete  signal  {x(t  —  kTg)}^1^1: 


Y,  X(t,f  +  uFs) 


v=-(L-l)/2 


The  above  PSD  is  equivalent  to  the  sliding  window  energy 
detector  that  is  proposed  in  [8].  The  authors  in  [8]  showed, 
through  simulations,  that  sliding  window  improves  the  perfor¬ 
mance  of  energy  detectors.  In  our  paper,  however,  we  derive 
in  (10)  of  the  Appendix  the  analytical  ROC  of  the  sliding 
window-based  energy  detector  as  a  function  of  the  smoothing 
window  length  L  and  prove  that  larger  sliding  windows  lead 
to  higher  detection  probability,  as  illustrated  in  Fig.  1. 

The  active  carrier  frequencies  in  a  certain  sub-band  are 
determined  by  setting  a  threshold  on  the  above  PSD.  The 
threshold  ypsD  is  based  on  the  Neyman-Pearson  optimality 
and  is  derived  in  (9)  of  the  Apppendix.  The  carrier  frequencies 
are  estimated  as  the  midpoints  of  the  segments  formed  by  in¬ 
tersection  between  the  PSD  curve  and  the  threshold  line  tjpsd- 
We  denote  by  2fc  the  set  of  all  detected  carrier  frequencies. 

Next,  an  estimate  of  the  spectral  autocoherence  function 
magnitude  [11],  [12]  is  computed  as  follows: 


I  CS(t,  f)  I  =  \SS(t,f)\  (3) 

yJS2{t,f  +  a/2)S2(t,f-a/2) 

We  denote  the  cyclic  sub-domain  profile  of  carrier  frequency 
fc  C  5c  as  Ix(t,  a,  fc)  ma x/£[/c— a/i,,/0+a/(/]  (^>/)l> 

where  fc  —  A  fp  and  fc  +  A  fjj  are  the  lower,  and  upper  limit 
of  the  considered  frequency  range  for  carrier  frequency  fc, 
respectively. 

In  [11],  it  is  shown  that  digital  signals  exhibit  cyclostation¬ 
arity  at  multiples  of  their  symbol  rates.  Moreover,  the  digital 
signals  may  exhibit  other  periodicities  as  well:  For  example, 
due  to  coding.  We  denote  the  RF  signature  of  the  signal 
centered  at  fc  as  RF(/C)  =  {a  ^  0  :  3EIx(t,a,fc)  >  £}, 
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where  0 e  denotes  the  indicator  function  of  event  E  = 
{Ix(t,a,  fc)  is  a  local  maximum},  and  £  £  (0,1).  It  is  hard 
to  derive,  analytically,  an  optimal  choice  for  the  parameter  £ 
due  to  the  complexity  of  the  cyclic  sub-profile  function  and 
the  environment  randomness.  However,  as  we  show  in  Section 
III,  we  apply  a  learning  algorithm  to  optimize  the  value  of  £ 
for  any  unknown  environment. 

II.  Feature  Extraction  Method 

We  define  a  feature  extraction  method  that  helps  to  estimate 
the  number  of  distinct  signals  at  each  carrier  frequency.  This 
method  estimates  the  number  of  periodic-peaks  sequences  in 
each  cyclic  sub-profile  that  is  centered  at  carrier  frequency 
fc.  According  to  the  periodicity  properties  of  the  cyclic  sub- 
profiles,  if  a  signal  exhibits  a  cyclic  component  at  a  certain 
cyclic  frequency  a,  then  it  also  exhibits  cyclic  components 
at  integer  multiples  of  a.  Thus,  each  periodic-peaks  sequence 
corresponds  to  at  least  one  communication  signal  type.  By 
detecting  the  number  of  periodic-peaks  sequences  in  a  certain 
cyclic  sub-profile,  we  may  determine  the  minimum  number  of 
distinct  communication  signals  that  are  transmitted  simultane¬ 
ously  at  each  carrier  frequency1.  Each  of  those  detected  signal 
is  represented  by  its  carrier  frequency  fc  and  its  fundamental 
cyclic  frequency  component  a,  thus  defining  a  2-D  feature 
space  for  identifying  the  minimum  number  of  users  at  each 
carrier  frequency2.  The  detected  features  will  be  classified  in 
a  2-D  feature  space  based  on  the  CRP,  similar  to  [10]. 

III.  Self-Reconfiguration  of  the  Spectrum  Sensing 

Module 

The  performance  of  the  Radiobot  is  related  to  the  quality 
and  accuracy  of  the  sensing  observations.  It  is  required  to 
optimize  the  sensing  module  so  that  it  best  estimates  the  RF 
activity  in  the  surrounding  environment.  Several  parameters 
may  need  to  be  optimized  during  the  sensing  process,  such 
as  the  sensing  duration,  detector  thresholds,  spectrum  sensing 
policies,  etc.  based  on  the  particular  RF  environment  it  encoun¬ 
ters  at  a  given  time.  Due  to  its  learning  and  reasoning  abilities, 
the  Radiobot  can  dynamically  adapts  these  parameters  based 
on  its  past  experience.  To  be  specific,  assume  that  the  Radiobot 
needs  to  optimize  its  cyclic  sub-profile  threshold  £  such  that 
it  achieves  a  certain  false  alarm  probability.  It  is  very  hard 
to  obtain  analytical  solutions  to  this  problem  due  to  the 
complexity  of  the  cyclic  profile  equation  and  to  the  uncertainty 
in  the  surrounding  environment.  A  possible  solution  is  to  learn 
the  optimal  threshold  value  iteratively  based  on  the  sensing 
observations,  as  in  [5]. 

An  online  learning  algorithm  was  proposed  in  [5]  to  adapt 
the  threshold  value  of  Neyman-Pearson  test  when  the  prob¬ 
ability  distribution  of  the  detected  signals  is  unknown.  The 
threshold  is  thus  dynamically  updated  to  achieve  a  desired 

1  Including  primary  users,  interferers  and  cognitive  secondary  users. 

2The  feature  space  can  be  extended  to  higher  dimensions  to  account 
for  other  signal  features,  such  as  the  direction  of  arrival.  This  allows  to 
discriminate  between  simultaneous  transmissions  having  the  same  carrier 
frequency,  symbol  rate  and  coding  rate. 


false  alarm  probability.  The  learning  process  is  conducted 
during  a  training  period  in  which  the  observed  data  are 
drawn  from  a  null  hypothesis.  In  our  case,  however,  we 
do  not  assume  a  training  period  and  we  propose  a  learning 
algorithm  that  updates  the  cyclic  sub-profile  threshold  £  during 
the  normal  operation  time  itself  to  achieve  a  desired  false 
alarm  probability  0.  By  the  help  of  the  energy  detection, 
the  learning  algorithm  identifies  the  absence  of  transmitted 
signals  to  perform  the  learning  process.  The  objective  of 
the  learning  algorithm  is  to  minimize  the  Kullback-Leibler 
distance  K(P\\Q)  between  two  probability  distributions  P  and 
Q,  similar  to  [5],  where: 

*(P|IC)=£P(i)log§§. 

We  denote  by  P  and  Q  the  desired  and  actual  probability 
distributions  of  the  cyclostationary  detector  output,  condi¬ 
tioned  on  the  absence  of  transmitted  signals.  These  proba¬ 
bility  distributions  correspond  to  Bernoulli  random  variables, 
representing  whether  a  signal  is  (1)  or  is  not  (0)  detected. 
By  defining  0  and  Pf(C)  as  the  desired  and  actual  false 
alarm  probabilities  (for  a  given  threshold  £),  respectively,  the 
Kullback-Leibler  distance  can  then  be  expressed  as: 

K{P\\Q)  =  K(d>,  Pf(  £))  =  </>  log  ^+(l-</>)  log  xl_^{0 

The  Kullback-Leibler  distance  P/(£))  Is  a  non" 

negative  quantity  representing  the  difference  between  0  and 
P/(£),  with  K[f>,  Pf(())  =  0  if  and  only  if  <fi  =  P/(£). 
Due  to  its  convexity  in  P/(£),  the  Kullback-Leibler  distance 
guarantees  a  global  minimum  and  makes  a  perfect  candidate 
for  convex  optimization  problems.  Moreover,  it  was  shown  in 
[5]  that  K((f>,  P/(C))  is  convex  in  £  if  and  only  if  P/(£)  is 
monotonous,  which  is  satisfied  in  our  case.  However,  since  the 
analytical  expression  of  P/(£)  is  unknown,  it  can  be  estimated 
as  the  ratio  of  sample  points  that  exceed  the  threshold  £  in 
the  cyclic  profile  I  {a),  when  there  is  no  transmitted  signals. 
As  noted  in  [5],  to  achieve  accurate  estimate  for  P/(£),  the 
recursive  adaptation  in  £  should  not  be  too  frequent.  This 


Algorithm  1  Learning  algorithm  to  control  the  cyclic  sub- 
profile  threshold  £ 

Initialize:  counter  =  1. 

while  No  signal  is  detected  by  the  energy  detector  do 

Update  the  false  alarm  probability  P/(£)  and  counter  = 
counter  +  1. 
if  counter  =  Nc  then 

Update  £  such  that:  £<—£  +  '£>  (P/(£)  —  </>)• 

Reset  counter  =  1. 
end  if 
end  while 


is  taken  into  account  in  the  proposed  learning  algorithm 
(Algorithm  1),  in  which  the  threshold  £  is  updated  after  each 
Nc  >  1  updates  of  the  false  alarm  probability  P/(£)-  The 
value  of  Nc  is  selected  such  that  it  achieves  convergence  of 
the  estimated  false  alarm  probability  P/(£). 
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Fig.  1 .  The  impact  of  the  sliding  window  length  L  on  the  ROC  of  the  earner 
frequency  detector. 

The  update  rule  in  Algorithm  1  minimizes  the  Kullback- 
Leibler  function  since  it  follows  a  gradient  descent  direction 
that  reduces  the  difference  |-P/(C)  —  <p\  at  a  learning  rate  of 
if)  >  0.  Moreover,  due  to  the  convexity  of  the  Kullback-Leibler 
function,  this  algorithm  is  guaranteed  to  converge  to  a  unique 
optimal  threshold  value. 

IV.  Simulation  Results 

First,  we  show,  in  Fig.  1,  the  performance  comparison 
between  the  sliding  window-based  ( L  >  1)  and  conventional 
(L  =  1)  energy  detections.  The  ROC’s  of  Fig.  1  show  that 
higher  detection  probabilities  can  be  achieved  for  larger  sliding 
window  length  L. 

Next,  in  order  to  verify  the  proposed  signal  classification 
and  learning  techniques,  first,  we  assume  two  arbitrary  users 
that  are  operating  in  the  sensed  sub-band  of  interest.  These 
users  are  assumed  to  transmit  simultaneously  at  a  WiFi  channel 
centered  at  fi  =  2.43767/ z'.  Their  transmit  signals  are  sensed 
and  downconverted  to  an  intermediate  frequency  (IF)  by  using 
a  local  oscillator  of  2.3567/7  (chosen  such  that  the  IF  signal 
can  be  sampled  using  a  realistic  ADC).  We  assume  that  the  first 
user  uses  a  symbol  rate  of  14Mbauds  and  the  second  system 
uses  12Mbauds  with  a  coding  rate  of  1/2.  In  each  sensing 
period  of  12/rs,  the  feature  points  (/c,a)  are  detected  and 
classified  into  clusters  based  on  the  CRP  [10].  Figure  2  shows 
the  clusters  after  20  sensing  periods.  We  notice  two  major 
clusters  (in  bold  circles)  which  occur  with  a  probability  higher 
than  0.1.  These  clusters  are  distributed  around  fc  =  87MHz, 
representing  the  carrier  frequency  after  IF  conversion.  The 
cluster  with  a  =  6MHz  corresponds  to  the  signal  of  12Mbauds 
with  a  coding  rate  of  1/2,  and  the  other  cluster  at  a  =  14MHz 
corresponds  to  the  uncoded  signal  of  14Mbauds. 

Next,  we  verify,  in  Fig.  3,  the  convergence  of  the  learning 
algorithm  proposed  in  Section  III.  We  let  <f>  to  be  the  desired 
false  alarm  probability  of  the  cyclostationary  detection  and  let 
(  be  the  control  threshold.  Starting  from  (f  =  0,  Algorithm 
1  converges  to  constant  threshold  at  which  the  actual  false 
alarm  probability  -P/(C)  converges  to  <p.  The  learning  rate 
is  set  to  ip  =  0.2  and  the  threshold  (f  is  updated  after 
each  Nc  =  20  updates  of  the  false  alarm  probability  Pf( </). 


Fig.  2.  Detection  and  CRP-based  classification  of  simultaneous  system 
transmissions  at  a  single  earner  frequency. 

Learning  curves  of  the  cyclic  sub-profile  threshold  C,  and  the  false  alarm  probability  Pf  with  \y=  0.2 


Fig.  3.  Learning  curves  of  the  cyclic  sub-profile  threshold  £  and  the  false 
alarm  probability  Pf  with  a  learning  rate  'ip  =  0.2  and  a  desired  false  alarm 
probability  cp. 

This  result  shows  that  the  proposed  learning  algorithm  can 
accurately  reconfigure  the  Radiobot’s  parameters  and  helps 
achieving  the  desired  performance  levels. 

V.  Conclusion 

In  this  paper,  we  have  proposed  an  autonomous  sensing 
architecture  for  a  futuristic  autonomous  CR,  referred  to  as  the 
Radiobot  [1].  The  proposed  system  applies  a  blind  cyclosta¬ 
tionary  detection  method  to  identify/classify  the  active  users 
in  the  RF  environment.  An  ROC  was  derived  showing  that 
energy  detection  can  be  improved  by  using  sliding  window 
averaging  techniques.  We  developed  an  unsupervised  learning 
mechanism  that  allows  the  Radiobot  to  learn  from  its  past 
experience  and  to  self-reconfigure  its  sensing  parameters. 
We  showed,  through  simulations,  that  the  proposed  learning 
algorithm  optimizes  the  Radiobot’s  parameters  in  order  to 
achieve  a  desired  objective. 
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Appendix 

Derivation  of  the  Threshold  and  the  ROC  for 
Carrier  Frequency  Detection 

Consider  a  sampled  data  sequence  {^Wlfclo1’  with  Ts  as 
the  sampling  period.  We  denote  by  {V(n)}raL"^1  its  discrete 
Fourier  transform  (DFT)  obtained  by  FFT: 

M  —  l 

X(n)  =  x{k)e~^2nn'M ,  for  n  =  0,  •  •  •  ,  M  —  1.  (4) 

k= o 

The  average  power  in  a  spectral  window  of  odd  length 
L,  centered  at  n,  can  be  approximated  by  T(n)  = 
1)/2  |X(n  +  /)|2.  In  order  to  derive  the  receiver  oper¬ 
ating  characteristic  (ROC)  of  the  Neyman-Pearson  detector,  we 


TC0  :  x(k)  =  w(k),  (5) 

TCi  :  x(k)  =  s(k)  +  w(k),  (6) 


where  {■u>(/c)}fclo1  are  m°deled  as  i.i.d.  Gaussian  random 
variables,  s.t.  w(k)  ~  ZNT(0,  Pn).  The  signal  {s(fc)}^f=g1  in 
(6)  can  be  modeled  as  i.i.d.  Gaussian  random  variables,  s.t. 
s(k)  ~  N(0,Ps).  This  is  a  reasonable  assumption  for  signals 
that  are  perturbed  by  propagation  through  turbulent  media  and 
multipath  fading  [13].  It  is  well-known  that  the  energy  detector 
is  the  optimal  detector  for  unknown  (random)  signals. 

Let  x  =  [a;(0),  ■  •  ■  ,  x(M  -  1)]T,  X  =  [X(0),  •  ■  •  ,  X(M  - 
1)]T,  XR  =  [3?{X(0)},---  {X(M  -  1)}]T  and  X7  = 
[3{X(0)},  •  •  •  ,  3{X(M— 1)}]T, ,  where  5R{}  and  Ss{}  denote 
the  real  and  imaginary  parts,  respectively.  The  DFT  in  (4)  can 
be  expressed  as: 


Xc  A 


X7 


=  Ax  , 


(7) 


where  A  is  a  2M-by-M  matrix  of  DFT  coefficients.  Since 
are  zero-mean  i.i.d.  Gaussian  random  variables, 
then  Xc  is  a  jointly  Gaussian  random  vector.  It  can  be  shown 
that,  under  fKo,  E{XC  (X.C)T}  =  AIPn  Im  (where  Im  is  an 
M-by-M  identity  matrix)  and  under  Jfi  E{XC  (XC)T}  = 
M(ps+p,n)  im .  Therefore,  elements  of  Xc  are  uncorrelated. 
Since  Xc  is  jointly  Gaussian  with  uncorrelated  elements,  the 
elements  of  Xc  are  then  independent.  Also,  since  all  the 
elements  have  the  same  variance  under  the  same  hypothesis, 
elements  of  X<:'  are  assumed  to  be  i.i.d.  zero-mean  Gaus¬ 
sian  random  variables  with  variance  AIPn  under  Jfo,  and 
M(Pn2+Pa^  under 

Under  the  above  assumptions,  T'(n)  =  fp  T(n)  is  a 
sufficient  statistic  for  the  hypothesis  testing  and  follows  a 
distribution.  The  threshold  77  for  carrier  frequency  detection 
is  defined  s.t.  Pr{T'(n)  >  ^Tfo}  <  aF,  where  qj?  is 
the  acceptable  false  alarm  probability.  Note  that  the  noise 
power  Pn  can  be  estimated,  for  example,  by  using  the  method 
proposed  in  [14]. 

The  Neyman-Pearson  decision  rule  S  for  carrier  frequency 
detection  is  then  defined  as: 


0  if  T'{n )  <  7 
1  otherwise 


(8) 


where  rj  =  27-1  (L;  (1  —  otp)  F(L)),  7_1  is  the  inverse  lower 
incomplete  gamma  function  (where  7 (k;  x)  =  fQ  tk~1e~tdt 
and  the  inverse  is  w.r.t.  the  second  argument)  and  F(fc)  = 
f0  tk~1e~tdt  is  the  gamma  function.  By  applying  this  to  the 
PSD  in  (2),  the  threshold  is  given  by: 


Vpsd  = 


yPn 
2  TSL 


'r-1{L-,{l-qF)T(L))Pn 

TaL 


(9) 


The  resulting  detection  probability  of  this  detector  can  be 
expressed  as: 


PD  =Pr{T’,(n)  >  7|Jfi}  =  1- 


2(l+SiVfl) 


r  (L) 


GO) 


which  represents  the  ROC  of  the  carrier  frequency  detector. 
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